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Abstract 

This paper examines possible psycholinguistic mechanisms governing stem 
vowel changes of irregular verbs in intermediate English learners of German 
as a foreign language (GFL). In Experiment 1, nonce-infinitives embedded in 
an authentic fictional text had to be inflected for German preterite, thus test¬ 
ing possible analogy-driven pattern associations. Experiment 2 explored the 
psycholinguistic reality of the so-called apophonic path by prompting two in¬ 
flections for one given nonce-word. Data were analyzed using generalized 
mixed effects models accounting for within-subject as well as within-item var¬ 
iance. The results of Experiment 1 and 2 support the notion of a pattern asso- 
ciator and yield only scarce evidence for the psycholinguistic reality of a uni¬ 
versal apophonic path. Therefore, the organization of irregular verb morphol¬ 
ogy in the mental lexicon of intermediate GFL learners might best be captured 
by the linguistic notion of structured lexical entries as well as the psycholin¬ 
guistic mechanism of an analogy-based pattern associator. 

Keywords : mental lexicon; irregular verb morphology; analogy; pattern asso¬ 
ciator; apophony 
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1. Introduction 

After around 130 years of research into the human mental lexicon (Whitaker, 
2006), the semi-regular paradigm of German irregular verbs, combining rule-like 
as well as idiosyncratic processes, has become the fruit fly in psycholinguistic 
research. It is thought to offer invaluable insights into storage and retrieval of 
words in the human mental lexicon. In fact, these insights promise to shed light 
on the nature of human cognition in general. 

Over the last two decades, German irregular verb morphology has been 
explored from various perspectives. Data come from behavioral LI studies of 
both children and adult speakers (Clahsen, Hadler, & Weyerts, 2004; Smolka, 
Zwitserlood, & Rosier, 2007), impaired speakers studies (M arusch, von der M als- 
burg, Bastiaanse, & Burchert, 2012; Penke, Wimmer, Hennies, Hess, & Roth- 
weiler, 2014), electrophysiological studies (Smolka, Khader, Wiese, Zwitserlood, 
& Rosier, 2013), neuroimaging (Luck, Hahne, & Clahsen, 2006), constructivist 
neural network modelling (Ruh & Westermann, 2008) and corpus studies (Ko- 
pcke, 1998). The above studies focus on processing differences, reporting and 
discussing possible evidence for qualitatively distinct mechanisms of inflection. 
While some findings suggest that all German verbs are inflected by the same 
psycholinguistic mechanism, there is evidence pointing towards two qualita¬ 
tively distinct processing routes in the speakers' mental lexicon. Overall, current 
findings appear to remain inconclusive in this respect. 

M ost studies concur, though, that some sort of pattern- and analogy-driven 
mechanisms would be necessary to handle a key component of German verb in¬ 
flection, namely vowel change, a systematic stem vowel alternation in verbal in¬ 
flection. Like in English, a lot of irregular German verbs exhibit such stem vowel 
alternations when inflected for the German past tense and past participle, such 
as singen-sang-gesungen 'sing-sang-sung.' Modern single-route approaches 
model vowel change through analogical or connectionist networks, while the 
dual-route model accounts for vowel change by means of a so-called pattern as- 
sociator. The psycholinguistic reality of such a pattern associator has been ex¬ 
plored in LI speakers in a number of studies (Bybee & Moder, 1983; Bybee & 
Slobin, 1982; Prasada & Pinker, 1993), but, unlike English, German as a foreign 
language (GFL) behavioral evidence is scarce. While recent studies focus on pro¬ 
cessing differences (Hahne, M uller, & Clahsen, 2006; Neubauer & Clahsen, 2009; 
Pliatsikas & M arinis, 2013; Strobach & Schonpflug, 2011), the intricacies of Ger¬ 
man vowel change remains largely unexplored. In a similar vein, Strobach and 
Schonpflug (2011) concluded that prior behavioral evidence appears inconclusive 
at best, suggesting further research was needed especially with regard to L2. 
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The present study tries to fill this gap. It reports findings from two well-es¬ 
tablished nonce-word research designs in which 82 intermediate English GFL 
learners provided preterite, participle, and infinitive formations to given German 
nonce-words. The rationale behind this study was to test in how far GFL vowel 
change was governed by a pattern associator, operating on prototypical schemas 
(Bybee & Slobin, 1982; Kopcke, 1998; Pinker, 1999), or by the so-called universal 
apophonic path (Segeral & Scheer, 1998). If universal apophony were psycholin- 
guistically real, GFL interlanguage patterns could not be attributed to transfer phe¬ 
nomena but would instead be the result of universal developmental effects. 

2. German irregularverb morphology and models of inflection 

There are about 160 simplex irregular verbs in modern German, although, depend¬ 
ing on the choice of reference corpora, counts vary considerably. While M iddle High 
German exhibited about 400 irregulars, the number decreased steadily, yielding to¬ 
day's moderate type-frequency of roughly 4% of all verbs (Kopcke, 1998; Bittner, 
1996). Their token-frequency, however, is remarkable, and roughly resembles that 
of the regulars (Clahsen, 1997; Clahsen, EisenbeiS, & Sonnenstuhl-Henning, 1997). 

The verbal paradigm in German is traditionally categorized as consisting of 
regular (weak) and irregular (strong) forms, combining rule-based affixation with 
irregular stem vowel changes. German participles, for instance, involve three mor¬ 
phological processes: a prosodically constrained ge- prefixation, optional stem 
vowel allomorphy, as well as -(e)n or -t suffixation, both occurring with roughly 
the same token frequency in adult and child corpora (Clahsen, 1997). Recently, 
however, analyses have become more complex with regard to form and function 
of stem vowel change or apophony within the paradigm (Trompelt, Bordag, & 
Pechmann, 2013; Wiese, 2008), resulting in regular, hybrid, and irregular classes. 
Table 1 illustrates the most common inflectional patterns. 


Table 1 German vowel change patterns 



Regular, one stem 
vowel only 

Hybrid, change in 
participle and pret¬ 
erite 

Irregular, change in pre¬ 
sent, preterite, and par¬ 
ticiple 

M ixed, vowel 
change H-suffix 

Infinitive 

spiel-en 'to play' 

trink-en 'to drink’ 

sprech-en 'to speak’ 

renn-en 'to run' 

Present tense 

spiel-t' plays' 

trink-t 'drinks’ 

sprich-t 'speaks’ 

renn-t'runs' 

3rd person 

Preterite 3rd person 

spiel-te 'played' 

trank'drank' 

sprach 'spoke' 

rann-te 'ran' 

participle 

ge-spiel-t 'played' 

ge-trunk-en 'drunk' 

ge-sproch-en 'spoken' 

ge-rann-V run' 


German stem vowel alternationsfor preterite and participle inflection form 
a patterned semi-regular paradigm. From a diachronic perspective, these patterns 
area mere artefact of older variants (Bybee & Newman, 1995) and appear opaque 
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and unpredictable (Nubling, Dammel, Duke, & Szczepaniak, 2006), once even 
apostrophized as "one of the classic chestnuts of morphological analysis" (Ander¬ 
son, 1988, p. 157). 

Due to this unpredictability, German irregular verbs exhibit restricted gen¬ 
eralization properties, and there is little consensus about the distinctiveness of 
attested vowel change classes. The phonological patterns, for example, are nei¬ 
ther necessary nor sufficient to predict to which class a verb should conform. A 
verb such as sinken 'to sink,' for instance, readilyjoins the irregular pattern dis¬ 
played by trinken 'drink' (trinken : trank : getrunken :: sinken : sank : gesunken), 
while winken 'to wave' and blinken 'to blink' do not. Accordingly, attempts to 
define vowel change categories vary considerably, ultimately treating almost 
each and every variant of vowel change as a class of its own. 

German verb morphology has been a controversial object of enquiry in psy¬ 
cholinguistics. Studies throughout the last 30 years have resulted in the long-stand¬ 
ing past-tense-debate (Pinker & Ullman, 2002; Wagner, 2010). From this debate, 
two approaches emerged. Single-route models suggest one mechanism to account 
for storage and retrieval of both regular and irregular forms in the mental lexicon. 
Such mechanisms are either symbolic or non-symbolic in nature. Symbolic models 
(Albright & Hayes, 2003; Bittner, 1996) are inherently rule-based and deterministic, 
which creates a number of disadvantages: They are not, for instance, convincing in 
capturing vowel change as a concatenative process, they do not allow for a gradual 
transition between regular and irregular mechanisms, and they often lack the psy¬ 
cholinguists underpinning from behavioral data (Becker, 1990). Analogical learners 
and connectionist networks, in contrast, model both regular and irregular verbal 
inflection entirely rule-free, conceiving linguistic representations in the mental lexi¬ 
con as graded and domain-general phenomena (Eddington, 2000; Goebel & Inde- 
frey, 2000; Westermann, Willshaw, & Penke, 1999). Connectionist models do this 
by cyclically building and relating weighted connections of parallelly distributed 
phonological and semantic information about verb infinitives and their inflections. 
Analogical learners store huge databases of individual exemplars and calculate gen¬ 
eralization properties among these on the basis of near-neighbourhood similarities. 

Trying to synthesize symbolic and associationist processing, the dual-route 
model assigns different mechanisms to regular and irregular morphology. Regular 
verbs would be processed by abstract symbolic rules, while irregular and thus lex- 
icalized German verbs would be stored undercomposed in memory and pro¬ 
cessed by associative patterning, not unlike what Ullman (2001) called declarative 
knowledge. Such an architecture would, as proponents of this approach claim, 
represent the two core characteristics of human cognition (Clahsen, 1999; Luck, 
Hahne, & Clahsen, 2006; Pinker, 1999). 
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3. German irregularverb morphology in L2 acquisition 

German verb morphology has not only been controversial in psycholinguistics, 
but it has also been the bane of countless language learners (Neubauer & Clah- 
sen, 2009; Pinker, 1999). In order to trace possible difficulties in the acquisition 
of German irregularverb morphology, studies looked at processing differences. 
However, as with LI speakers, there is conflicting evidence. Advanced Greek 
learners of German, for instance, employed two different mechanisms for regu¬ 
lar and irregular inflection (Pliatsikas & M arinis, 2013), while Polish learners re¬ 
lied more on lexical storage and retrieval compared to German LI speakers 
(Neubauer & Clahsen 2009). And Strobach and Schonpflug (2011) report that 
their data from English learners of German were consistent with a single-route 
connectionist account. GFL vowel change, however, has not been examined in 
detail yet. For English L2, there is evidence in favour of an analogical or proto¬ 
typical organization in the mental lexicon of advanced German learners of Eng¬ 
lish (Wagner, 2010). At the heart of such prototype models are output-oriented 
morphophonological schemas, governing each vowel change class. Such classes 
emerge through exemplars sharing various morphophonological properties to 
varying degrees, thus following the principles of family resemblance (Rosch, 
1975). For German LI, prototypical schemas have been suggested, too (Bybee, 
1995; Wagner, 2010), but there is evidence that, contrary to English, analogical 
formations in German rely almost exclusively on a verb's rhyme (Penke, 2006). 

An intriguing alternative to prototypical schemas comes from Segeral and 
Scheer's (1998) universal apophonic path. They claim that apophony across all 
types of languages followed a unidirectional five-part vowel change sequence 
[0->i->a->u->u], providing all the necessary information in order to predict 
German stem alternations. This is possible only because the authors sophisticat¬ 
ed^ reanalyze the German verb paradigm as consisting of an infra-segmental 
level. The alleged universality of their path is challenging, though, especially 
since both English and German provide additional supportive evidence from 
children's nursery rhymes, onomatopoeic expressions, and expletives. Amongst 
these, apophonic vowel changes such as [i-a] are remarkably frequent. The pat¬ 
tern [i-a-u] is, in fact, the most frequent type of a three-vowel-change pattern 
in German. If this apophonic path were to be psycholinguistically real, it would 
seriously challenge both single- and dual-route-models. 

The two nonce-word elicitation experiments reported in this paper are 
supposed to test these competing approaches. If a universal vowel change 
mechanism were psycholinguistically real, we should see learners of German (a) 
predominantly, or even exclusively, use vowel changes faithful to the apophonic 
path, and (b) adhere to its mono-directionality, thus rejecting vowel-changes 
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prompted in the reverse direction. If German verb morphology were governed 
by prototypical schemas, we should expect (a) a strong influence of the verb's 
constituents on vowel change, (b) a complete overlap of the prototypical test 
item with the statistically most effective constituents, and (c) an insensitivity 
towards directionality effects. 

4. General method 

4.1. Overview 

Both experiments employed a repeated measure elicitation task, using nonce- 
words either embedded in an authentic fictional text or as part of a sentence 
completion frame. Although nonce-word elicitation cannot be controlled as 
thoroughly as, for instance, priming, lexical decision or eye-movement experi¬ 
ments, this research paradigm was used because of its natural setting and its 
long history in psycholinguistics (Berko, 1958; Bybee & M oder, 1983; Lemhofer 
& Radach, 2009). In Experiment 1, 34 participants inflected 28 nonce infinitives 
and 15 distractorsfor preterite. In Experiment 2,48 participants inflected either 
28 irregular nonce-infinitives for preterite and participle, or 28 nonce preterites 
for infinitive and participle. The order in which items were presented varied 
across the test questionnaires in both experiments, thus helping counterbalance 
possible sequence effects (Prasada & Pinker, 1993; Ramscar, 2002). 

4.2. Stimuli 

Based on the lexical statistics reported in Kopcke (1998), the 28 most prototypical 
vowel change triggering nonces were filtered out of 344 phonotactically well- 
formed constituent combinations. A subset of Experiment 2 used preterite forms 
by transposing the infinitive stems [i:] and [i] into [a]-, [o:]-, or [oj-vowel-change 
preterites, depending on the most frequent analogies to existing verbs. Thus, a 
nonce like strieften would result in stross by analogy to schlieBen 'to close, to lock;' 
schloss 'closed, locked.' In both experiments, test items' constituent variants in¬ 
cluded onset structures C, CC, [J], [J]C, and [J]CC, as well as coda variants [g], 
[m/m], [p], [rjk], and [s/g]. All stimuli can be found in the appendixto this paper. 

4.3. Participants 

Data for Experiment 1 came from randomly recruited GFL speakers of the Uni¬ 
versity of M aynooth, University of Edinburgh, and University of Leicester (N = 
34). Data for Experiment 2 were randomly sampled among GLF speakers from 
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Dublin City University, Trinity College Dublin, the University of M anchester, and 
University of Hull (N =48). Random sampling was done within one cohort of the 
respective study programs involving German, and participants were not nested 
within classes. The test questionnaires were administered in small groups. 
Among the 82 participants, there were 42 females and 40 males. They were all 
Irish or British citizens, their LI was English, there were no bilinguals, and they 
had been residents in their respective countries from birth. The mean age of the 
participants was 19 years (SD =1.19), with a range from 17 to 21 years. They all 
had been learning German in instructional settings as part of their school cur¬ 
ricula for more than 4 years and approached intermediate level B1 (Council of 
Europe, 2001), They all studied German as part of their respective undergradu¬ 
ate degree programs. All subjects were unimpaired speakers. They did not re¬ 
ceive any remuneration for their participation. 

4.4. Procedure 

For Experiment 1, participants were told that their data would feed into a new trans¬ 
lation of Burgess' classic novel A clockwork orange (Burgess, 1962). The introduction 
pretended that the learners' intuition was deemed instrumental in finding adequate 
translations of the notorious A/acfsat-slang verbs. Using such a real novel extract, rid¬ 
dled with artificial slang, was meant to avoid unwanted semantic associations and 
distract the attention away from the linguistic details. Overall, great care was taken 
to leave subjects in Experiment 1 deliberately naive to the real purpose of the exper¬ 
iment. After two items for practice, an audio CD provided the text with the intended 
pronunciation of the nonces as well as the timing for the gap-filling. Contrary to Pra- 
sada and Pinker (1993), and following Orsolini and Marslen-Wilson (1997), it was 
deemed appropriate to present the stimuli only once and elicit only one response to 
a particular test item in both experiments. This was supposed to discourage strategic 
approaches and tap into retrieval processes as directly as possible. 

The questionnaire of Experiment 2 informed participants that they were taking 
part in a genuine linguistic experiment about how German verbs are stored and or¬ 
ganised in the mental lexicon. Then they were told that, in order to make the experi¬ 
ment more interesting, they would be asked to creatively inflect non-existing forms; 
thus, they were encouraged to rely on their linguistic intuition about the appropriate 
sound of the inflection. One subset of the participants in Experiment 2 were asked to 
inflect given infinitivesfor preterite and participle, while the other one was prompted 
to inflect given preterite forms for infinitive and participle. Prompting one subset to 
inflect backward was supposed to test directionality effects. Recall that the apophonic 
path defines rule-like input-output derivations of one vowel quality out of another. 
Such derivations are assumed to be directional, and hence they are not expected to 
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work in the reverse order. In other words, if the apophonic path was real, English 
learners of German should prove to be able to complement a fragmentary apophonic 
path such as [i->x] with ease, whereas the reverse direction such as in [x<-a] should 
prevent speakers from systematic vowel change inflections. 

Using nonce participles for testing the apophonic path's directionality 
proved inadequate since, contrary to English, in German the choice of possible 
preterite inflectionsfrom a given participle in a reverse order is rather restricted. 
A given [u] as a participle stem, for instance, would almost automatically elicit 
[a] in the preterite, since [i-a-u] isthe by far the most predominant German pat¬ 
tern. In order to test possible learners' preferences, German preterites and not 
participles were given. After two items for practice, a CD provided all sentence 
frames, thereby timing the filling of the gaps and controlling potential confusion 
about the pronunciation of a verb (like [d] or [o:] in an item such as plog). 

4.5. Data coding and analysis 

Test items in both experiments were coded for onset, nucleus, and coda variants, 
covering onset structures C, CC, [J], [J]C, and [J]CC, two nuclei, [i:] and [i], as well 
as codas [g], [m/n], [rj], [qk], and [s/p]. After inspection of frequency tables, on¬ 
set was collapsed into [J] with optional consonant versus consonant only, and 
coda into consonant, velar, and fricative. 

Responses in Experiment 1 were coded for vowel changes in [a], [o], and [u], 
mixed inflections, and the residual category others. Wherever ATR contrasts were 
irrelevant for the present analyses, vowel variants such as [a:] versus [a], [i:] versus 
[i], [o:] versus [d], and [u:] versus [a] were collapsed into [a], [i], [o], and [u] (Wiese, 
2000). Responses in Experiment 2 were coded according to the type of vowel 
change series they produced as well as their well-formedness. Data were analyzed 
using generalized mixed regression models as well as non-parametric, conditional 
inference trees built through recursive partitioning. Both analyses were done using 
the statistical software R, version 3.3.1 (R Core Team, 2016). M ixed modelling was 
done using the package \me4 (Bates, M aechler, Bolker, & Walker, 2015), and recur¬ 
sive partitioning employed the party package (Hothorn, Hornik, & Zeileis, 2006). 
Although multifactorial ANOVAs appear to have been the predominant tool of 
choice in numerous LI and L2 studies related to the past tense debate, mixed models 
are much more suited to the present data (Cunnings & Finlayson, 2015; Cunnings & 
Linck, 2015; Gries, 2015; Jaeger, 2008). They can, for instance, handle non-orthog- 
onal, unbalanced, and nested designs, typical of repeated measure experiments. 
They can also directly model dichotomous dependent variables and avoid unwar¬ 
ranted assumptions about sphericity, and they simultaneously incorporate random 
variance of test item and subject (Baayen, Davidson, & Bates, 2008). 
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5. Results and discussion of Experiment 1 

In Experiment 1, participants produced, apart from regular inflections (41%), 
three vowel change patterns. The most prominent one was [i-a], with more than 
28%, followed by [i-o] with around 15%, and vowel changes in [u] with 4%. M ixed 
inflections also covered around 4%. Overall, less than 50% of the responses follow 
the default, and only 7% of the responses are random. Both the high amount of 
non-default responses and the high amount of attested vowel changes are re¬ 
markable. The nonces spingen, stingen, and schingen attracted the highest vowel 
change response frequencies, producing predominantly [a]-vowel-change. This 
could be taken as a first indication of a schema with the form [J (C)_i_rj] being 
prototypical for the most productive vowel change class. In order to investigate 
whether this overall schema did in fact contain the most prototypical constitu¬ 
ents, we first looked at the influence of each constituent on the choice between 
regular and vowel-changing preterites (excluding the residual others ). The mosaic 
plot in Figure 1 illustrates the distribution of all preterites by constituents. 


preterite over onset and coda preterite over nucleus and coda 

cons velar.nasal 0 ^ 3 fricative cons coda3 ve | ar _nasal fricative 



Figure 1 Mosaic plots of two log-linear independence models illustrating the 
choice of preterite by onset and coda (left panel), and preterite by nucleus and 
coda (right panel) 

In the mosaic diagrams, both horizontal and vertical asymmetries indicate 
significant partial effects of the constituents on the type of preterite formation (p 
c.001). Size and shading of each tile in the diagrams represent frequencies as well 
as direction and significance of effects. The darkest grey shade indicates more ob¬ 
servations than expected. In the left panel, the dark tiles in the middle and top- 
right illustrate significant effects for the schemas [JC_n(k)] and [C fricative]. Onset 
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has almost no effect. In the right panel, effects for [#_i:_ rj(k)] and [#_i_fricative] 
are visible (middle and top-right dark tiles). In order to explore the above effects 
in more detail, we performed a general linear mixed effects analysis of the rela¬ 
tionship between the constituent variants and the binary choice of preterite. To 
estimate parameters, restricted maximum likelihood was used instead of penal¬ 
ized quasi-likelihood since the latter is reported to produce biased estimates with 
binary response variables (Thiele & Markussen, 2012). As fixed effects, we en¬ 
tered nucleus and coda without interaction term into the model. Their factor lev¬ 
els were treatment-coded, so coefficients in the model correspond to simple ef¬ 
fects. Nucleus and coda were not crossed since some combinations, due to pho- 
notactic reasons, were not represented with data points. 

As random effects, we included intercepts for subjects and items as well as 
by-subject and by-item random slopes for coda since the effect of coda seemed to 
significantly vary across subjects and items, too. M oreover, including random slopes 
renders mixed models conservative and minimises a-errors. The dependent varia¬ 
ble was coded binary (regular vs. irregular). M ain effect p values were obtained by 
likelihood ratio tests (all p values <.005). It was also tested if both fixed and random 
effects were highly correlated, and if the model was overdispersed; neither was the 
case. Scaled residuals were distributed fairly symmetrically. 


Table 2 Table of coefficients of the final model, estimated using Laplace approx¬ 
imation as well as the optimiser bound by quadratic approximation (bobyqua) 
instead of the heuristic default Nelder-M ead method 




Logits 

SE 

z 

P 


Intercept 

0.19 

0.35 

0.52 

.60 

Nucleus (effect size =0.06) 

i: 

-0.25 

0.30 

-0.82 

.41 

Codas (effect size =0.28) 

n(k) 

-1.04 

0.31 

-3.39 

<.001*** 


s/g 

-0.59 

0.43 

-1.36 

.17 


Note. Logits are the log-odds of the parameter estimations; SE is the standard error; z-values come 
from the corresponding Wald statistics. 


When building the final model, we first followed Barr, Levy, Scheepers, 
and Tilly (2013) and therefore tried to include all fixed effects with their interac¬ 
tions as well as all random intercepts and slopes. However, adding interactions 
and random slopes other than coda prevented models from converging, most 
likely because there were too little data for the number of parameters to be 
estimated. Given that there still is little consensus as to how to deal with con¬ 
vergence problems (Barr et al., 2013; Cunnings & Finlayson, 2015), these terms 
were dropped again. Table 2 summarises the partial main effects of the constit¬ 
uents' variants of the final model with a pseudo conditional R 2 of 0.62. 
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Table 2 illustrates a highly significant main effect for coda (x 2 (2) =14.28, p 
< .001), and a significant partial effect for codas with the velar nasal plus op¬ 
tional obstruent. In other words, [rj(k)] significantly affects the choice of preter¬ 
ite formations, decreasing the log odds for regulars by -1.04 (SE =0.31), resulting 
in a decreased probability for regular responses of p =.30. Compared to nucleus, 
coda has a substantial effect size (range) of 0.28. Other than that, there are no 
significant partial effects. However, if we plot nucleus and coda as interactions, 
we can see the results presented in Figure 2. When looking at the simple main 
effects (left panel), we can see that the probability of regulars slightly decreases 
when we go from short (0.41) to long (0.35) nucleus. As for codas, probabilities 
of regulars decrease for velars (0.28) and fricatives (0.37) compared to other 
consonants (0.52). Note, though, the overlap of the error bars in both panels, 
indicating that we are dealing with merely mild effects. In the right panel we can 
see a substantial decline in probability for regulars (down to 0.30) for short nu¬ 
cleus [i] followed by velars (solid line). Similar, but less pronounced, is the de¬ 
cline for long nucleus [i:] followed by fricatives (0.34, dashed line). In sum, the 
mixed effects analysis revealed effects for [#_i_n(k)] as well as [#_i:_fricative]. 

nucleus effect plot coda3 effect plot 

nucleus*coda3 effect plot 



Figure 2 Effects plots for the individual partial effects in the final model (left 
panel) and the interaction of nucleus and coda (right panel) 


The choice between the different vowel changes in the preterites the learn¬ 
ers produced was modelled using conditional inference trees built through recur¬ 
sive partitioning. Figure 3 shows such a tree model for the different types of vowel 
change by onset, nucleus, and coda. 

The tree in Figure 3 is the result of a split algorithm. Unlike traditional classifica¬ 
tion and regression tree modelling, however, in which algorithms attempt to increase 
information gain measures, conditional inference tree modelling uses inferential test 
statistics in order to retain significant predictors only. As a consequence, the above tree 
model does not contain any covariates that are independent of the choice of vowel 
change patterns. Tree growth is thus based on statistical stopping rules, so that neither 
pruning nor cross-validation are required, and the data cannot be overfitted. 
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a mix o reg u a mix o reg u a mix o reg u a mix o reg u 


Figure 3 Conditional inference tree for the types of vowel change by onset, nu¬ 
cleus, and coda 

The algorithm finds only one significant predictor, namely coda (topmost 
split in the tree, Node 1). This is in line with the mixed model above, where the 
only significant main effect occurred with coda, too. In the tree diagram, we can 
see that the distinction between nasal coda structures on the one hand (Node 2 
on the left) and the fricatives and the velar obstruent on the other is highly signif¬ 
icant (p < .001). The final level of the tree provides the proportional frequencies 
for the four kinds of vowel change. We find the highest proportion of regular re¬ 
sponses for fricatives and the velar obstruent (Node 7) as well as codas [m] and 
[n] (Node 3). In contrast, most of the vowel change responses can be found for 
[n(k)L favoring [a] (Node 4), and [s/g], favoring [o] (Node 6). Overall, the tree 
model suggests a highly significant influence for coda on the choice between dif¬ 
ferent vowel changes, with velar nasals favoring [a], and fricatives favoring [o]. 

To sum up, the nonces producing most vowel changes had the schema 
[T(C)_i_q ]. For vowel change versus the default, an independence model showed 
significant effects for [JC_o(k)], [Cfricative], [#_i:_rj(k)]and [#_i_fricative], while the 
generalized mixed model yielded a significant effect for the rhymes [#_i_rj(k)] and 
[#_i:_s /<;]. When predicting the types of vowel change, coda was the only significant 
predictor, with [rj(k)] favoring vowel changes in [a] and fricatives favoring [o]. 

6. Results and discussion of Experiment 2 

In Experiment 2, conditional inference trees modelling the type of vowel change 
also showed effects for coda, with [rj(k)] favoring [i-a-u] patterns, and fricatives 
favoring [i-o-o]. Recall, though, that Experiment 2 was supposed to test whether 
the participants' vowel changes were faithful to the predictions made by the 
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apophonic path. Although there is a good deal of consistency between expected 
and elicited patterns (x 2 (10) = 146.27, p < .001 for given infinitives, x 2 (10) = 
270.43, p < .001 for given preterites), there are quite a few inconsistencies. 


1st response vowel 
change preterite 

[a] 

(231) 

mixed 

(14) 

[o] 

(85) 

[u] 

(21) 


2nd response vowel 
change participle 

[a-o] 

(59) 

[a-u] 

(62) 

[a-a] 

(48) 

others 

(62) 

various 

(15) 

[o-o] 

(57) 

others 

(33) 

[U-U] 

(16) 


1st response vowel 



2nd response vowel 

change infinitive 



change participle 



71 

others 

(20) 

[i-a-o] (144) 

-> 

-> 

regular 

(60) 

-> 

[e-a] 

(36) 




[i-a] 

(28) 



71 

others 

(13) 

[i-a-u] (216) 

-> 

-> 

regular 

(72) 

-> 

[e-a] 

(44) 




[i-a] (87) ! 



71 

others 

(25) 



-> 

regular 

(71) 

[i:o-o] (240) 

-> 

-> 

[i-o] 

(81) 



-> 

[e-o] 

(58) 



il 

[a-u] 

(5) 

1: o: o:] (72) 


-> 

various 

(72) 


Figure 4 Interaction between preterite and participle response (left) and infini¬ 
tive and participle responses (right), excluding regular first responses, with ar¬ 
rows marking the various paths of inflectional combinations 


First, the learners produced 38% unattested vowel changes. For responses 
starting from a given infinitive, this is evidence against the apophonic path. M ore- 
over, 32% of the responses to given nonce preterites were regular. While it is un¬ 
derstandable for given infinitives to prompt learners to apply the default and not 
inflect it by analogy to a phonologically nearest neighbor, it is surprising when it 
comes to the nonce preterites. Those preterites, such as schoch, stang, or schnoss 
obviously lack regular suffixes. The 32% regular responses indicate either that the 
nonce's obvious irregular phonological shape must have been inaccessible to 
quite a few learners, or that the directionality of the apophonic path rules out 
reverse ablaut. However, more than 50% of the responses to a given nonce pret¬ 
erite show vowel change commensurate with attested patterns. This, in turn, is 
evidence against a directional apophonic path since, clearly, a lot of learners de¬ 
veloped grammatical vowel change sequences starting backwards from a given 
nonce preterite. Figure 4 illustrates inconsistencies in further detail. 

What we can see in Figure 4, left panel, is an erosion from [a]-vowel-chang- 
ing patterns in the first (preterite) to the second (participle) response. 231 preter¬ 
ites branch out to four different participle patterns. And both [a]- and [o]-vowel- 
changes create unorthodox forms in the participle (others). On the right, we can 
see that given preterites prompted four different infinitive patterns, which, too, 
branch out heavily into various attested and unattested vowel changes (others). 
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In sum, evidence against the apophonic path lies in the inconsistencies when pro¬ 
ducing vowel change from nonce infinitive to preterite and participles, as well as 
in the analogically formed irregular infinitives from given nonce preterites. 

In order to model the relationship between the nonce given, the two re¬ 
sponses, and the overall grammaticality of the vowel change series, generalized 
mixed models with restricted maximum likelihood were fitted to the data. In all 
models, random intercepts were included for subject and items. Factor levels of 
predictors were treatment-coded. A maximal random-effects-structure with all 
random slopes as well as interactions of fixed effects prevented models from 
converging, so terms were dropped whenever this happened. 

First of all, the tense given turned out to be a significant predictor for the 
first (x 2 (1) =9.57, p < .005) and second response (x 2 (1) =4.76, p < .03), but, 
interestingly, not for the grammaticality of the overall vowel change pattern (x 2 
(1) =0.04, p = .84). In contrast, in a model with the two responses as fixed ef¬ 
fects, both are highly significant (p <.001), It thus appears as though it was not 
the given nonce verb as such that predicted the grammaticality of the overall 
patterns, but the interplay between the two inflections the learners produced. 
This is again evidence against the apophonic path since the stimulus repre¬ 
sented the directionality dimension of the apophonic path and should have re¬ 
sulted in well-formed vowel change patterns for given infinitives only. 

The final model included both the tense given and the two responses as 
fixed effects. Fixed and random effects were tested for multicollinearity. The 
predictors' inflation factor was below 2.5, and the model was not overdispersed. 
Scaled residuals were distributed fairly symmetrically. Table 3 summarises the 
final model, with a pseudo conditional R 2 of 0.78. 


Table 3 Table of coefficients of the final model, estimated using Laplace approx¬ 
imation as well as the optimizer bound by quadratic approximation (bobyqua) 
instead of the heuristic default Nelder-M ead method 




Logits 

SE 

z 

P 


Intercept 

-3.25 

0.41 

-7,96 

<.001*** 

Nonce given (effect size =0.13) 

Preterite 

0,63 

0.48 

1.31 

.19 

1st response (effect size =0.85) 

Ungrammatical 

2.65 

0.39 

6.85 

<001*** 

1st response (effect size =0.01) 

Regular 

-3.76 

0.45 

-8.41 

<001*** 

2nd response (effect size =0.80) 

Ungrammatical 

5.60 

0.44 

12.66 

<001*** 

2nd response (effect size =0.02) 

Grammatical 

0.28 

0.95 

0.29 

.077 


Note. Logits are the log-odds of the parameter estimations; SE is the standard error; z-values come from the 
corresponding Wald statistics; *** p <.001, 


Table 3 shows that the given tense is again insignificant. Ungrammatical first 
responses, compared to grammatical ones (mapped onto the intercept), result, 
however, in a significant increase of log odds by 2.65, and thus in a probability for 
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ungrammatical patterns of 0.93. Likewise, ungrammatical second responses show an 
increase in log odds by 5.6 and a probability of 0.99 for ungrammatical overall pat¬ 
terns. The ungrammatical second response hasthe largest effect size in this model. 

In sum, the faithfulness to attested patterns is the result of the interplay 
between first and second response and not of the directionality prompted by 
the nonce verbs. It can therefore be concluded that the apophonic path cannot 
convincingly account for the present data. 

7. Conclusion 

How far do the data of the two experiments support a schema-based or universal 
apophonic organization of vowel change in GFL speakers? First of all, we do see 
certain constituent combinations, such as [JC_n(k)] and [C fricative], triggering 
non-default inflections. This observation is by no means trivial since apparently 
even a restricted, classroom-based exposure to German makes analogical for¬ 
mations possible already in intermediate learners (Murphy, 2004; Gor& Cherni¬ 
govskaya, 2005 for similar results for learners of Russian). The learners' LI could, 
though, have a positive effect, since English and German irregularverb morphol¬ 
ogy are not all that different (Clahsen, Felser, Neubauer, Sato, & Silva, 2010). 

Second, there is an almost complete overlap between the nonces produc¬ 
ing most of the attested vowel changes in Experiment 1 (spingen, stingen, and 
schingen) and the most effective constituent combinations, thus having the cue- 
validity one would expect from a prototype (Rosch & Mervis, 1975). However, 
contrary to English (Bybee & M Oder, 1983; Bybee & Slobin, 1982), it is predom¬ 
inantly the verbs' rhymes, such as [#_i_n(k)] and [#_i:_s/g], which account for 
vowel change patterns. So far, these findings are only partly in line with proto¬ 
typical schemas. Instead, as Penke (2006) already suggested, the internal organ¬ 
ization of irregular German verb schemata might rely solely on the rhyme. 

Third, the double inflections in Experiment 2 illustrate processes incom¬ 
patible with the apophonic path. Instead, they reveal a certain analogical 
productivity in a reverse fashion (Becker, 1990). It remains unclear, though, why 
there were so many default responses to given irregular nonce preterites. This 
is indeed surprising since both beginning and advanced learners are reported to 
pay considerable attention to stem-changes (Godfroid & Uggen, 2013), are less 
sensitive to a verb's morphological structure (Neubauer & Clahsen, 2009), and 
prefer declarative memory over symbolic processing. Overall, the present Ger¬ 
man L2 data support the notion of an analogical pattern associator, either as 
part of a dual-route model or as an analogy-based single mechanism. 

From a theoretical point of view, it seems as though structured lexical entries 
from minimalist morphology might best provide a theoretical account for German 
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L2 vowel change (Clahsen, 1999; Wunderlich, 1996; Wunderlich & Fabri, 1995). 
Structured lexical entries are organized as hierarchical trees, with nodes and sub¬ 
nodes relating underspecified grammatical features to each other. However, those 
trees might not just organise individual underspecified entries, but the entire par¬ 
adigm, or verb classes, as such. Therefore, inflectional classes, and even regular 
verb morphology, would have a psycholinguistic reality and would not merely be 
the by-product of their members (Trompelt, Bordag, & Pechmann, 2013). 

From a processing point of view, promising avenues for further research 
could lie in micro-rules, analogical learners, connectionist networks, and probabil¬ 
istic models. The efficiency of so-called micro-rules has been proven for English past 
tense (Albright & Hayes, 2003). They argue for a model inductively creating micro¬ 
rules for both regular and irregular verbs. It is difficult, though, to qualitatively dis¬ 
tinguish instance-based input-output rules and analogical formations. Both pro¬ 
cesses might ultimately be just the two sides of the same coin. On the one hand, 
the entirety of exemplars and their related forms create patterns and thus facilitate 
new analogical formations, but, on the other hand, provide the basis for abstract 
micro-level subregularities and thus facilitate deterministic rules with the extent of 
one exemplar (Becker, 1990). In other words, rules could be interpreted as highly 
reinforced representational patterns and schemas (Bybee, 1988). Appropriate alter¬ 
natives to such symbolic approaches might lie in analogical learners (Aha, Kibler, & 
Albert, 1991; Daelemans, Zavrel, van der Sloot, & van den Bosch, 1999; Eddington, 
2004, Skousen, 1989), connectionist networks (Westermann, Willshaw, & Penke, 
1999) or probabilistic models (Albright, 2009; Baayen, 2003; Baayen & Hay, 2005; 
Gor & Chernigovskaya, 2005). Probabilistic models might, in fact, turn out to be par¬ 
ticularly suited since they can incorporate gradience and learning experience. 

From an acquisitional perspective, the most important finding in this study 
is probably that verb inflections by analogy to existing patterns appear to be pos¬ 
sible even at an intermediate learner level. The present GFLlearners have, despite 
limited classroom-based exposure to the foreign language, accumulated enough 
exemplars of the verbal paradigm in their mental lexicons to generalise patterns, 
enabling them to productively handle new linguistic experiences. What, however, 
does this mean for the acquisition of German in instructional settings? On the one 
hand, if learners generally turned out to prefer lexical retrieval by means of ana¬ 
logical generalization, instructions revolving around morphological analyses of 
verbformswould be of little help; instead, massive exposure facilitating analogical 
inference would be called for. In that respect, further research should look into 
possible constraints and threshold levels for such generalization properties. On 
the other hand, English speaking learners of German, in particular, could benefit 
from a crosslinguistic focus on forms in order to make them aware of both the 
similarities and differences between the two verbal paradigms. 
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APPENDIX 

Table of test items and distractors of Experiment 1 and 2 

Experiments 

No. 

1 


2a 



2b 


Test items 

Distractors 

Test items 

Distractors 

Test items 

Distractors 

1. 

knieGen 


knieGen 


knoss 


2. 

schrimmen 


schrimmen 


schramm 


3. 


luschen 


nisseln 


nisselte 

4. 

spinken 


spinken 


spank 


5. 

grieGen 


grieGen 


gross 


6. 


rabotten 


pieben 


piebte 

7. 

stinnen 


stinnen 


stann 


8. 

biechen 


biechen 


boch 


9. 


govoriten 


witten 


wittete 

10. 

strinken 


strinken 


strank 


11. 


vidden 


stiemen 


stiemte 

12. 


loviten 


stitzen 


stitzte 

13. 

schiechen 


schiechen 


schoch 


14. 

schringen 


schringen 


schrang 


15. 


sotteln 


miegeln 


miegelte 

16. 

stiechen 


stiechen 


stoch 


17. 


schmatten 


hitteln 


hittelte 

18. 

strimmen 


strimmen 


stramm 


19. 

spingen 


spingen 


spang 


20. 


pitschen 


sicken 


sickte 

21. 

schliegen 


schliegen 


schlog 


22. 


kritschen 


pitschen 


pitschte 

23. 

schingen 


schingen 


schang 


24. 

fieGen 


fieGen 


foss 


25. 

sprinken 



lichsen 


lichste 

26. 


boppen 


liepen 


liepte 

27. 


schlippen 

sprinken 


sprank 


28. 

fiechen 


fiechen 


foch 


29. 

stingen 


stingen 


stang 


30. 

triechen 


triechen 


troch 


31. 


krasen 


bristen 


bristete 

32. 

strieGen 


strieGen 


stross 


33. 

schinnen 


schinnen 


schann 


34. 

pliegen 


pliegen 


pi°g 


35. 


dengen 


fritteln 


frittelte 

36. 

kingen 


kingen 


kang 


37. 

friegen 


friegen 


frog 


38. 


wetschen 


krietschen 


krietschte 

39. 

linnen 


linnen 


lann 


40. 

schnieGen 


schnieGen 


schnoss 


41. 


harrken 


fietzen 


fietzte 

42. 

schminnen 


schminnen 


schmann 


43. 

frinken 


frinken 


frank 
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